The best choice problem for random number of objects with a refusal probability
نویسندگان
چکیده
In this note, wewill consider the best choice problem for random number of objects with a refusal probability. There aremany papers and books on this subject fromChow/Robbibs/Siegmund(1971), Siryaev(1973) downward. These problems has been close relations with the martingale theory, the potential theory, the optimality principle in the mathematical programming. And now it is derived the optimal rule by using the sum-the-odds theorem. Refer to Ferguson(2006), Bruss(2003), Tamaki(2001), Hsiau/Yang(2000) etc. It would be prefered to call it the odds-theorem rather than the secretary problem or the best choice nowdays. Here we adapt a classical analysis to obtain its optimal value for the problem on Markov processes. Firstly the formulation of stopping problem forMarkov processes are given and its optimal equation of the value are designated. Since the secretary problem has a special transition probability, we show the optimal value is calculated as One-Look ahead policy or monotone stopping problem. Then it is treated asymptotically and solve the value in the explicitly form by a differential equation. Some equivalence relation is shown between the decision-theoretic form of the optimal equation and the transition probability with refusal. In the last, by a translation from the case of random number of objects to the standard form of optimal stopping problem, we could obtain the optimal value in the cited case of title. Also a variation for the selection is considered. 1 Formulation and Notations The optimal stopping problem is a special case of Markov decision processes and the decision maker can either select to stop so as receiveing rewards, or to pay costs and continue observing the state. Let xn, n = 0, 1, 2, · · · be a Markov Chain with a transition probability P = P(i, j), i, j ∈ S over a state space S in R1. We assume that S is countable, but this is inessential and for our discussion so that the scaling limit case of problems with the random numbers can be treated. For a stopping time τ, the aim is to maximize the expectation of payoff v(i; τ) starting at i and its optimal value v(i) defined by v(i) = sup τ<∞ v(i, τ) = sup τ<∞ E [ r(xτ − τ ∑ n=0 c(xn) ∣∣∣ x0 = i ] , i ∈ S (1.1)
منابع مشابه
Optimal Stopping Policy for Multivariate Sequences a Generalized Best Choice Problem
In the classical versions of “Best Choice Problem”, the sequence of offers is a random sample from a single known distribution. We present an extension of this problem in which the sequential offers are random variables but from multiple independent distributions. Each distribution function represents a class of investment or offers. Offers appear without any specified order. The objective is...
متن کاملGrouping Objects to Homogeneous Classes Satisfying Requisite Mass
Grouping datasets plays an important role in many scientific researches. Depending on data features and applications, different constrains are imposed on groups, while having groups with similar members is always a main criterion. In this paper, we propose an algorithm for grouping the objects with random labels, nominal features having too many nominal attributes. In addition, the size constra...
متن کاملA redundancy allocation problem with the choice of redundancy strategies by a memetic algorithm
This paper proposes an efficient algorithm based on memetic algorithm (MA) for a redundancy allocation problem without component mixing (RAPCM) in a series-parallel system when the redundancy strategy can be chosen for individual subsystems. Majority of the solution methods for the general RAPCM assume that the type of a redundancy strategy for each subsystem is pre-determined and known a prior...
متن کاملMinimax-Optimal Strategies for the Best-Choice Problem When a Bound is Known for the Expected Number of Objects
For the best-choice (or secretary) problem with an unknown number N of objects, minimax-optimal strategies for the observer and minimax distributions for N are derived under the assumption that N is a random variable with expected value at most M, where M is known. The solution is derived as a special case of the situation where N is constrained by Ef(N) <_ M, where f is increasing with f(i) f(...
متن کاملThe Relationship of Intelligence Beliefs, Self-Regulation, and Metacognition with School Refusal Behavior in Secondary High School Girl Students
This study aimed to determine the relationship of intelligence beliefs, academic self-regulation, and metacognition with school refusal behavior in female secondary high school students in Amol. This study was a descriptive one with correlational design. The statistical population consisted of all the female secondary high school students of which 214 students were selected using multistage ran...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009